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Abstract — In this two-part series of papers, a generalized 
non-orthogonal amplify and forward (GNAF) protocol 
which generalizes several known cooperative diversity pro- 
tocols is proposed. Transmission in the GNAF protocol 
comprises of two phases - the broadcast phase and the 
cooperation phase. In the broadcast phase, the source 
broadcasts its information to the relays as well as the 
destination. In the cooperation phase, the source and the 
relays together transmit a space-time code in a distributed 
fashion. The GNAF protocol relaxes the constraints im- 
posed by the protocol of Jing and Hassibi on the code 
structure. In Part-I of this paper, a code design criteria 
is obtained and it is shown that the GNAF protocol is 
delay efficient and coding gain efficient as well. Moreover 
GNAF protocol enables the use of sphere decoders at the 
destination with a non-exponential Maximum likelihood 
(ML) decoding complexity. In Part-II, several low decoding 
complexity code constructions are studied and a lower 
bound on the Diversity-Multiplexing Gain tradeoff of the 
GNAF protocol is obtained. 

I. Introduction & Background 

Recently there has been a growing interest in coop- 
erative diversity techniques, wherein multiple terminals 
cooperate to form a virtual antenna array to leverage 
the spatial diversity benefits even if a local antenna 
array is not available. Since the works of [l]-[4], several 
cooperative transmission protocols have been proposed 
[5]-[10]. These protocols widely fall under two classes- 
Amplify and forward (AF) and Decode and forward 
(DF). The AF protocol is particularly attractive for 
two reasons - first, the operations at the relay nodes 
are considerably simplified, and second, we can avoid 
imposing bottlenecks on the rate by not requiring the 
relay nodes to decode [6]. Hence in this paper, we are 
mainly interested in the AF protocols although extension 
to the DF protocols can also be done. 

The contributions of this paper are 

• Propose a new AF protocol for cooperative diversity 
that generalizes the protocols of Jing and Hassibi 



[6], Nabar et. al. [7] and Azarian et. al. [5]. 

• It is shown using pair-wise error probability (PEP) 
analysis that a diversity order of R + 1 is possible 
with R relays, with the duration of the cooperation 
phase equal to R channel uses whereas the known 
protocols [9] need at least R + 1 channel uses. 

• It is shown that any square complex orthogonal 
design achieves full diversity when employed in 
GNAF protocols. 

• For the GNAF protocol, irrespective of the Dis- 
tributed Space-Time Code (DSTC) used in the 
cooperation phase, the average sphere decoder com- 
plexity is not exponential even if the destination has 
only one receive antenna. This was not the case in 
some of the earlier proposed protocols [9]. 

The paper is organized as follows: In Section [U] we 
describe the GNAF protocol and in Section[ll]] we derive 
a code-design criteria based on PEP analysis. Also, we 
point out the delay optimality, coding gain advantage 
and sphere decoding complexity advantage. Simulation 
results are provided in Section Hvl 

The proof for all the theorems, lemmas and claims are 
omitted due to lack of space. 

Notation: For a complex matrix A, A*, A T and A H 
denote the conjugate, transpose and conjugate transpose 
respectively. Aj denotes the real matrix obtained by 
taking the real parts of all the entries of the matrix A 
and Aq denotes the real matrix obtained by taking the 
imaginary parts of all the entries of the matrix A. For a 
square matrix B, \B\ and Tr (B) denote the determinant 
and trace of the matrix B respectively. 

II. GNAF Protocol Description 

We consider a wireless relay network with one source, 
R relays and a single destination D. The links between 
the different terminals are assumed to be fading links. 
The channel path gains from the source to the i th relay, 



denoted by /j and those from the j th relay to the 
destination denoted by gj are all assumed to be i.i.d 
CAf (0,1). The channel path gain, go from the source 
to the destination is also assumed to be CAf(0, 1). We 
assume that all the terminals are synchronized at the 
symbol level. Every transmission from the source to the 
destination comprises of two phases, i.e., the broadcast 
phase comprising of T\ channel uses and the cooperation 
phase comprising of T 2 channel uses 1 . We next describe 
three different versions of the GNAF protocol. 

GNAF-I: In the broadcast phase, the source terminal 
communicates with the relay and destination terminals. 
In the cooperation phase both the relays and the source 
communicate with the destination terminal. 

GNAF-II: The broadcast phase is same as GNAF- 
I protocol. In the cooperation phase, only the relays 
communicate with the destination terminal. 

GNAF-III: In the broadcast phase the source terminal 
communicates with the relays only. In the cooperation 
phase the source terminal as well as the relays commu- 
nicate with the destination terminal. 

In GNAF protocols, the relays are allowed to trans- 
mit only a complex linear combination of the symbols 
(received from the source terminal during the broadcast 
phase) as well as their conjugates. To perform these 
linear operations each relay is equipped with a pair of 
matrices called 'relay matrix pair'. 

The signal model for GNAF-I protocol is as shown in 
at the top of the next page, where 

- s is the vector transmitted by the source from a 
codebook consisting of c & = {si, S2, ■ ■ ■ , sl} com- 
plex vectors of size Ti x 1 satisfying E {s H s} = 1. 

- r.i denotes the received vector at the i relay and 
ti denotes the vector transmitted by the i th relay. 
At and Bi are complex matrices of size T2 x T\ 
satisfying || A, ||| + || Bi \\ 2 F < 1. The pair of 



matrices Ai and Bi will be called the 'relay matrix 
pair' for the i th relay. 
- yr>,i and yu.2 denote the received vector at the 
destination during the broadcast phase and coop- 
eration phase respectively. w\ and W2 represent the 
additive noise at the destination whose entries are 
i.i.d CAf(0, 1). The quantities 7Ti,7T2 and tt^ are the 
power allocation factors satisfying 7Ti + 112 + Rtt3 = 
T1+T2 so that P represents the total average power 
spent by the source and the relays together. 

The received vector at the destination can be written 



'The channel fade coefficients are assumed to remain constant for 
the entire duration (cooperation frame) of Tj + T2 channel uses and 
may vary independently from one cooperation frame to another. 
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(b) GNAF-I protocol 

Fig. 1 . Frame structure of NAF and special case of GNAF-I for a 
two relay network 



as follows 
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where S, H and W are as shown in © at the top of the 
next page. The DSTC in this case is the collection of all 
the (Ti + T 2 ) x 2(R + 1) matrices S and we call this 
DSTC as GNAF-DSTC. 

The NAF protocol and GNAF-I protocol: The NAF 

protocol proposed in [5] can be viewed as a special case 
of GNAF-I protocol. The frame structure of the NAF 
protocol for a two relay network is shown in Fig[0a). 
Here b\ and 62 represent the scaling factor used by the 
relays in order to meet their respective power constraints. 
By simply permuting the time slots of the NAF protocol, 
we get the frame structure shown in Figtnb). It is easy 
to observe that the frame structure in Fig[Hb) is a special 
case of the GNAF-I protocol with the relay matrix 



pairs being A\ 
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B l = 0; A 2 







and B2 = 0. Moreover, since the performance of a ML 
receiver is invariant to permutations of the received 
vector, it follows that the optimal DM-G tradeoff of the 
GNAF-I protocol is at least as good as the NAF protocol 
[5]. 

III. Code Design Criteria for GNAF-DSTCs 

To simplify the PEP analysis, we will require the noise 
components of rii = AiVi + BiV* to be uncorrected. 
Simplifying the expression for rii we get 



nn 




riiQ 





An + Bu 

AiQ + BiQ 



-A iQ + B iQ 
An — Bu 



m 

ViQ 



Vd,i 

t, 

VD,2 



/ ■jriP gos + Wi 

/ ■KxPfjS + Vi, V i = 1, . 



,R 



= 9iti + V^Pgo (AqS + BqS*) + W2 



(1) 



H 1 
W 



43^3 a 



■2('lF + l) 

91 fl 



AqS AiS 

9r/r 
wi 





A R s ^%l F 1 t 1) B s* B lS * 

gif* ■■■ gnf R ] 



ip+ 





B R s* 



(3) 



From the above equation, it is clear that the 
required conditions for the noise components 

of rii to remain uncorrected is that the matrix 

CT + B u -Ai Q + S iQ n 

[ A lQ + B iQ An - B u j 
property that E^Sf is a diagonal matrix V i 



should have the 
1,...,R. 



Theorem 1: Let Si be the transmitted codeword and 
Sj be some other codeword. Let A5* = Si — Sj and 
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where, is the maximum variance of the noise com- 
ponents of rii — AiVi + BiV* over all i = 1, . . . , R. 
If SiEf is a diagonal matrix V i = 1, . . . , R and if 
M = (A5) ff L>- 1 (A5) has rank < (R + 1), then for 
large i? and large P, the pairwise error probability (PEP) 
that a ML receiver erroneously decodes to Sj can be 
upper bounded as 
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where, ct^^j is the minimum non-zero eigen value of 
M. 
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For large R 



and P, the PEP can be shown to be upper bounded by 

PEP< { l } \I 2(R+1) + ^PMFF H r 1 . 

Given that rank(M) < (R + 1), for large P, it can be 
shown that 



PEP < 



-rank(M) 
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Hence the design criteria is to maximize the rank of 
AS. If we replace the identity matrix in the first column 
of AS by the all zero matrix, then it will correspond 
to GNAF-III protocol. It is easy to see that a diversity 
order of R + 1 is possible in the GNAF-III protocol if 
T 2 > (R+l). By setting T x = T 2 , tt 2 = and replacing 
the identity matrix in the first column of AS* by the all 
zero matrix, we get the Jing and Hassibi protocol [6]. 
Then, if T 2 > R, a diversity order of R can be obtained. 
In [6], Ai and Bi were restricted to real square matrices 
and a code design criteria was obtained only for a more 
restrictive case, i.e., for any i = 1, . . . , R, either Ai = 
or Bi = 0. For the general case, the authors conjectured 
that if T2 > R, a diversity order of R was possible. This 
conjecture has been proved here for a much more general 
case assuming that the number of relays is large. Further, 
the design criteria to achieve the required diversity order 
has also been obtained. 

Notice that the first column of AS* is not in the 
linear span of the other columns of AS*. Hence it is 
sufficient for the sub matrix obtained by deleting the 
fist column and first T\ rows of AS to satisfy full rank 
condition. Further note that even if we delete in addition 
the (R + 2) th column of AS containing the matrix Bq, 
the maximum achievable diversity order is not disturbed. 
We call the resultant sub-matrix the 'extended relay 
matrix' which is shown below. 
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Hence if T% > R, a diversity order of R + 1 can be 
achieved by GNAF-I as well as GNAF-II protocols. Thus 
T2 = R is the least possible delay. Notice an important 
fact that the source continuing transmission even in the 
cooperation phase is not mandatory for getting diversity 
R+l. However, it controls the coding gain as will be 
shown later in this section. 



The maximum diversity achievable by the protocol of 
[6] is only R. Therefore the new protocol increases the 
diversity order by one for the same number of relays. In 
[9], by some other modifications to the protocol in [6], a 
diversity order of R + 1 is obtained. The GNAF protocol 
is differs from that of [9], since here the destination 
processes a vector of size T\ + T 2 which is in contrast 
to processing a vector of size T 2 as assumed in [6], [8], 
[9]. Also the protocol in [8], [9] required T 2 > (R + 1) 
in order to achieve diversity R + l, whereas the GNAF- 
1,11 protocols require only T 2 > R. Hence the GNAF 
protocol is delay efficient. 

To highlight another advantage, let us consider R+l 
relays for the protocol in [6] and R relays for the GNAF- 
I,H and III protocols. Then the PEP expression derived 
in [6] for large P, is given by 



PEP < cP 



(5) 



where, c is some positive constant. Note that for any P, 
the negative exponent of P in the PEP (upper bound) 
expression @ for the GNAF protocol is slightly more 
than the corresponding term in (|5Jl. Hence the probability 
of error would decay faster as a function of P for the 
GNAF protocol. Intuitively, this happens because the 
transmission from the source to destination experiences 
only one channel fade coefficient whereas if the same 
transmission was made through the relay, then it would 
experience a product of two channel fade coefficients. 

The generalized sphere decoding algorithm can be 
applied to decode s from y. It can be observed that the 
size of vector y is always greater than the size of vector 
s for all practical choices of T\ and T 2 in GNAF-1,11 
protocols. This leads to an equivalent channel model 
which is never under-determined. Hence the average 
sphere decoder complexity does not become exponential 
for any choice of T\ and T2 (or equivalently any GNAF- 
DSTC). 

We will now give an example of a GNAF-DSTC 
which will achieve diversity R + 1 in GNAF-1,11 pro- 
tocols. 

Example 1: Let T X =T 2 = R= 4. Consider the 4 x 4 
complex orthogonal design which is shown below. 
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Let us write it in the format required for us. 
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It is easy to check that 



4(2:0, £1,2:2) 4(a;o, xi, £ 2 ) 

(Ei=o I x i l 2 ) J 4- Thus the design <d 4 ( Xo , XuX2 ) satisfies 
the required rank criteria and hence can achieve diversity 
5 in a 4 relay network. But note that only the rows of 
@4( Xo X1 X2 \ are orthogonal but not the columns. Hence 
the most sought after single symbol decodability feature 
of the complex orthogonal design is lost. 

In a similar manner, it can be easily shown that any 
square complex orthogonal design achieves full diversity 
in GNAF protocols. In [9], the use of orthogonal designs 
was suggested by doubling the number of symbols trans- 
mitted in the broadcast phase since complex conjugacy 
was not permitted at the relays in their protocol. However 
we see that if square orthogonal designs are employed, 
it is not actually necessary to do so. 

A. Coding Gain 

In this subsection, we shall further consider only a 
special case among the class of linear dispersion codes 
wherein for any i = 1 , . . . , R, either A; = or £>; = 
to show that GNAF protocol leads be higher coding gain 
compared to that of [6]. Without loss of generality, we 
shall assume that the first relays have £>; = and the 
remaining R — N relays have A t = 0. Further we assume 
that Bo = 0. Then the system model can be simplified 
to obtain 
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and S is given by the S in (0 with appropriate columns 
dropped. If M = (AS , ) H L»- 1 (A5) is full rank for all 
pairs of distinct codewords, then it can be shown that 
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Thus \M\ can be taken to be a measure of coding gain 
at high SNR. For very small P(P « 1) and large R, 
it can be shown that 

m P 2 
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where, M' = (AS) H (AS). Thus at low SNR, 
the design criteria is to maximize Tr(M'). 
Let AS be as shown in at the top of 

B N + 1 As* . . . B R As* 1 

and let M ER = (AS E r) h (AS E r)- For large P, we 



this page. Also, let 
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have 

/ \ R+l 

w = (rr^p-J + 11 (As) 1,2 |MEnl ) 

(10) 

which shows that the coding gain for GNAF-I pro- 
tocol (\M\) is more than that of GNAF-III protocol 
(\M\) and Jing-Hassibi protocol (\M E r\). Note that 
the coding gain for GNAF-I protocol (\M\) depends 
upon the matrix Aq, Thus the power allocation fac- 
tors 7Ti , 7T2 , 7T3 and the matrix Aq have to be care- 
fully chosen to optimize the coding gain. Further we 
have Tr (M 1 ) = Tr (m) + || (As) || 2 . Hence at 
low SNR also, GNAF-I protocol performs better. This is 
because the transmission by the source in the broadcast 
phase has been made use of at the decoder, and the 
source has been permitted to transmit all the time. 

IV. Simulation results 

In this section, we present simulation results of the 
CIOD for 4 relays to show that the GNAF-II protocol 
gives better error performance at all values of SNR 
compared to the Jing and Hassibi protocol [6]. The 
chosen values for the various parameters are T\ = T2 = 
R = 4, 7Ti = 2i±2i, tt 2 = and tt 3 = 2kt?k The 
signal set was taken to be QPSK rotated by 31.7175°. 
Fig|2] shows the ML decoder's symbol error rate and 
codeword error rate performance comparison for both 
protocols. Observe that performance in the GNAF-II 
protocol uniformly dominates the performance in Jing 
and Hassibi protocol. Also observe that the probability 
of error decays faster in the GNAF-II protocol than in the 
Jing and Hassibi protocol. This is because the GNAF-II 
protocol offers a diversity order of 5 as compared to 4 
by the Jing and Hassibi protocol. 
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